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Abstract 


Conformal radiotherapy planning needs accurate delineations of the critical structures. Atlas-based 
segmentation has been shown to be very efficient to delineate brain structures. It would therefore 
be very interesting to develop an atlas for the head and neck region where 7 % of the cancers arise. 
However, the construction of an atlas in this region is very difficult due to the high variability of 
the anatomies. This can generate segmentation errors and over-segmented structures in the atlas. 
To overcome this drawback, we present an alternative method to build a template locally adapted 
to the patient's anatomy. This is done first by selecting in a database the images that are the most 
similar to the patient on predefined regions of interest, using on a distance between 
transformations. The first major contribution is that we do not compute every patient-to-image 
registration to find the most similar image, but only the registration of the patient towards an 
average image. This method is therefore computationally very efficient. The second major 
contribution is a novel method to use the selected images and the predefined regions to build a 
Frankenstein's creature" for segmentation. We present a qualitative and quantitative comparison 
between the proposed method and a classical atlas-based segmentation method. This evaluation is 
performed on a subset of 58 patients among a database of 105 head and neck CT images and 
shows a great improvement of the specificity of the results. 





1 Introduction 


Conformal radiotherapy allows to precisely target the tumor while keeping an acceptable 
level of irradiation on neighboring critical structures. This however requires to accurately 
locate the tumor and the organs at risk in order to determine the best characteristics of the 
irradiation beams. This delineation task is usually done manually and is therefore very long 
and not reproducible. 


To solve this problem, atlas-based segmentation has been shown to produce accurate and 
automatic segmentations of the brain [1], allowing to take into account easily the relative 
positions of the structures. The construction of an atlas for other regions such as the head 
and neck region, where 7 96 of the cancers arise, is therefore of great interest [2, 3]. Methods 
have been presented for the construction of an unbiased average model from an image 
dataset such as [4, 5]. We presented in [6] a method to build a symmetric atlas from a 
database of images manually delineated following the guidelines in [7]. However, 
anatomical variability is very high in the head and neck region. An average atlas may 
therefore be very different from the patient, leading to registration discrepancies. Moreover, 
this variability may cause the mean contours to be too large in the atlas yielding over- 
segmentations. 


1duosnue|y Jouiny Vd-HIN 1duosnue|y Jouiny Vd-HIN 


1duosnue|N Jouinv Vd-HIN 


Commowick et al. 


2 Method 


Page 2 


To overcome these drawbacks, methods have been presented towards the creation of atlases 
whose anatomy is adapted to the patient [8, 9]. First, Blezek et al. [8] presented an 
interesting approach to cluster a database into several atlases representing homogeneous 
sub-populations. However, the selection of the most adequate atlas with respect to a given 
patient is not addressed. Another method [9] has been introduced to select the most similar 
images to the patient by comparing a similarity measure between each database image and 
the patient. This method is however computationally expensive, requiring to register all the 
database images on the patient. Moreover, a local comparison of the images is more adapted 
in our case, as our database consists of manually delineated patients who present pathologies 
that may corrupt a global comparison. 


In this paper, we present the development of an atlas locally adapted to the patient to get 
more precise delineations than with an average atlas. To this end, we first present a new and 
efficient method to select the most similar images to the patient on predefined regions. Each 
most similar sample is defined as the one that needs the smallest local deformations to be 
registered on the patient. These images are then combined into a template image, as an 
analogy to Frankenstein’s creature [10], and used for segmentation. 


We first present our approach to select the image that is the most similar to the patient Pto 
delineate on a given region. We then focus on the combination of the local templates 
selected into one single template for delineation. Finally, we show qualitative and 
quantitative results on a database of 105 head and neck CT images, showing a great 
improvement of the specificity of the results. 


In this section, we present an efficient method to compute a template that is similar to the 
patient Pon predefined regions R; based on the following steps: 


- Construction of an average atlas M from the database images (pre-computed) 
- Non linear registration of the patient Pto delineate on M 
- Selection of the most similar image Î for each local region R} 


- Computation of the anatomy Mand segmentations from the set of 7; 


2.1 Selection of the Locally Most Similar Images to a Patient 


For each region of interest R}, we select the most similar image among our database to the 
patient to delineate. It is defined as the one that is the “less” deformed to be non linearly 
registered on the patient. We denote by 75. 4 the transformation linking two images A and 
B, so that B can be resampled on A (i.e. A ~ BO Tp —,). The selection for a given region R; 
is then based on a comparison of the non linear transformations 77... pto bring each image J; 
of the database on P, i.e. the most similar image is defined as: /;= arg min, dg GJ P) = arg 
min Ij dgy(Tj Tj—P Id), where dg, will be defined later on. However, this type of comparison is 
computationally very expensive as it requires to perform all the registrations between Pand 
the images J; for each patient to segment. To perform an efficient selection, we therefore use 
an intermediate image: an average atlas M pre-computed from the database using [6]. 


From the average atlas construction, we obtain for each image J;an affine transformation 
Ar..—M and a non linear transformation 77. 4; bringing it on the average image M. 
Moreover, when registering Pon M, another non linear transformation 7p.—,is computed. 


The key hypothesis is then to assume that 77. pcan be approximated by 77.—4;O T 


P—M 
This hypothesis presents many advantages. first, the regions of interest R;can be defined 
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once and for all on the atlas image M. Moreover, this can be done very easily thanks to the 
average segmentations available on the atlas. Also, the similarity between Pand J, dr, 
(Tj Ij—P Id), can be approximated using the following equation: 


d, (T, íoT, ls l= M Mog(T,. oT. GI 
ieRioT, y 


(1) 


where / corresponds to the voxels of the dense transformation and dr, is the Log-Euclidean 


distance on diffeomorphisms [11] between the identity transformation and 77.—4;,O I An 
Using our hypothesis, we need to perform only one non linear registration between M and P 
to select the locally most similar images /;for all regions R; therefore reducing drastically 


the computation time. 


2.2 Piecewise Most Similar Atlas Construction 


We now focus on the computation of a template for segmentation from the selected images 
I; and the regions R; This template, similar to the Frankenstein's creature [10], is built by 
iterating over the following steps to combine the images: 


- Registration of the images 7;on the average image at iteration k: My 
- Compute the new average image Mp1, based on the regions R; k 


- Compute an average transformation 7; from the transformations 7| TM 
Apply T. to My] to get the new reference Mı = Mg O T. 


Update the regions of interest by applying T to Rik: Rig = RKO T. 


This algorithm can be seen as an extension of [4] to the construction of an atlas where 
images have spatially varying weights, depending on the regions R; In contrast, Guimond et 
al. consider implicitly that all images have equal and spatially constant weights (1/N for 
each image if N images are averaged). The final step is then to associate a set of 
segmentations to this anatomy. This is done by transforming the manual segmentations of 
the image 7; present in the region R,onto M, using the transformations obtained in the 
construction process, and ensuring that no overlap exists between the final structures. 


Average Image Computation—To compute Mj, from the images Î registered on Mp 
we first need to define the spatial extensions of each region at iteration k: R74. This will 
allow to use, on each R;x, only the corresponding I; and to interpolate between the regions. 
The weight functions w; (x) are computed in three steps: locally erode the regions A; using 
the method presented in [12] to ensure a minimal distance between them, compute the 
inverse of the minimal distance to each Aj w; (x) = l/(1 + adist(x, Rj4)), and normalize 


the wi GO3R G9 Q9/ 377 wO, 


The images /;are then aligned onto Mẹ first globally resulting in affine transformations 
Aj;<Mp and then non linearly producing dense transformations 77. 7, These 
transformations and the w; (x) are then used to compute My ;: 


L 
Mi=) Wi) (hed, V oT, 4 )69 @ 


Ift, 
=I 
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Residual Deformation Computation—Similarly to [4], the next step consists in 
averaging the 77.—,7, into a transformation T and apply its inverse to Mj, to get the new 


reference M44 = Mya. O L. However, we are averaging transformations using spatially 
variable weights w; (x). To take this into account and ensure that 7; is a diffeomorphism, 
we introduce a generalization of the Log-Euclidean (LE) polyaffine transformation to 
diffeomorphisms, as suggested in [13]. The polydiffeomorphism construction is based on the 
LE framework for diffeomorphisms [11], allowing to compute operations easily while 
staying on the manifold of diffeomorphisms. 7; is then built by integrating between time 0: 
x(0) = x and time 1 : x(1) = T(x) the following Ordinary Differential Equation (ODE): 


N 
i Ww (k) = =e 
i=) Wik(x)log A e) Similarly to the LE polyaffine framework, 7; and T, ! are 
expressed respectively as the exponential of the right hand side of the ODE, and the 
exponential of its opposite. 


3 Evaluation Methodology 


4 Results 


To evaluate our method, we have used a database of 105 CT images of patients delineated 
for head and neck radiotherapy following the guidelines provided in [7]. Segmented 
structures included 12 structures: lymph nodes II, III and IV, parotids and sub-mandibular 
glands on each side as well as the spinal cord and the brainstem. On this database, we have 
repeated a Leave-One-Out approach, each time picking out one patient from the dataset of 
images. The average atlas is then built from the remaining images. We used this framework 
to compare two segmentation methods: the average atlas-based segmentation, and the 
locally most similar image based segmentation. The Frankenstein's creature was built by 
defining on the average atlas a region of interest for each structure. This gives a total of 12 
selected images, combined together into a single composite patient. This image is then 
registered on the left-out patient to get its segmentation. 


The results were then compared to the manual delineations of the left-out patient using two 
voxel-based overlap measures: sensitivity (rate of true detection of the structure) and 
specificity (rate of true detection of the background). As they were delineated for 
radiotherapy, some structures were not available. We have therefore used the Leave-One- 
Out evaluation on a subset of 58 patients which had 8 or more manual delineations. Finally, 
the separation between some structures (lymph nodes, brainstem and spinal cord) are made 
on an arbitrary axial plane, based on possibly moving anatomical landmarks. This may lead 
to errors in the separations of the automatic segmentations and artificially low quality 
measures for all methods. We addressed this by evaluating together the brainstem and spinal 
cord, and by grouping the lymph nodes on each side. 


4.1 Qualitative Evaluation 


We first present in Fig. 1 the visual comparison of the average atlas and locally most similar 
image for one patient. This example illustrates very well that the average atlas anatomy may 
be significantly different from the patient after a global registration. This may result in 
registration discrepancies and in erroneous segmentations. This is particularly visible in the 
lymph nodes areas (see axes in the images) where the patient is much more corpulent than 
the atlas. The composite patient is much closer visually to the patient. The deformations 
between these two images will therefore be easier to recover and this will contribute to 
minimize the registration errors. 


We then present in Fig. 2 the qualitative evaluation of the segmentation results, using the 
Leave-One-Out framework. This figure first shows that the atlas-based segmentations 
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(images (b,e)) are overly large when compared to the manual ones. This is due to the inter- 
expert segmentation variabilities when creating the average segmentations, resulting in 
overly large segmentations in the atlas itself. This over-segmentation almost disappears 
using our approach. Only a single manual segmentation is indeed used for each structure, 
leading to more accurate segmentations, particularly on the lymph nodes areas. Finally, there 
are still differences in some regions (see arrow in image (f)). These differences are due to 
the local specificities of the selected manual segmentation, induced by the inter-expert 
segmentation variability. 


4.2 Quantitative Evaluation 


We finally present in this section the quantitative evaluation of the results using the Leave- 
One-Out framework described in section 3. We present in Table 1 the average quantitative 
results (sensitivity and specificity) computed using the Leave-One-Out framework on 58 
patients. We also indicate the number of structures on which each average was performed. 
The patients in the database were indeed not totally segmented manually and we therefore 
computed the quantitative values for the available structures. 


This table shows an important improvement of the specificity measure in the locally most 
similar method with respect to classical atlas-based segmentation. This confirms the 
observations made in the qualitative results as this measure increases as the over- 
segmentation of the structures with respect to the manual segmentations decreases. 
However, the sensitivity is lower in the locally most similar case. This is mainly due to the 
inter-expert segmentation variabilities illustrated in Fig. 2. Nevertheless, these results are 
very promising and show that the locally most similar image allows to obtain an atlas whose 
anatomy is close to the patient to delineate and to remove the over-segmentation. 


5 Conclusion 


We have presented a new method to select, up to an affine transformation, the locally most 
similar images associated to regions of interest predefined on a precomputed average atlas. 
This is based on the use of a Log-Euclidean distance between transformations obtained 
through atlas construction and the transformation to register the patient on the atlas. As the 
atlas is pre-computed, the selection method is very efficient, requiring only one non linear 
registration. We have then associated to this selection a novel framework to build from these 
images a composite patient to be used as a template for the patient segmentation. 


This method was validated by comparing it to atlas-based segmentation on a subset of 58 CT 
images among a database of 105 head and neck patients. The segmentations are not over- 
segmented anymore using our approach. This is seen both on qualitative and quantitative 
results (specificity). This method has a great interest and could also be applied to many other 
regions where large variabilities may be seen in the patients anatomies, such as the abdomen 
region. 


We have seen in our experiments that the sensitivity results are corrupted by a large intra- 
and inter-expert segmentation variability. It is also partially responsible for the overly large 
segmentations in the average atlas. In the future, we aim at studying further this 
segmentation variability by computing locally, in the average atlas reference frame, the 
changes of each delineation, for example using the Staple algorithm [14]. This would be of 
great interest to reduce the variability inuence on the segmentation process. 


Finally, we will study in the future other selection criterions, such as intensity based 
comparisons between the images, and compare their performance on the selection of the 
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locally most similar image. We will also study how to combine these different criterions to 
obtain a very robust selection criterion. 
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Fig. 1. Example of Computed Piecewise Most Similar Image 
Comparison between the patient image (a), (d) ; the average atlas (b), (e) and the locally 
most similar image (c), (f). All images are globally registered on the patient's image. 
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Fig. 2. Qualitative atlas and piecewise most similar image based segmentation results 
Comparison between the manual segmentations (a), (d) ; the atlas-based segmentations (b), 
(e) and the piecewise most similar image segmentations (c), (f). 
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